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EMTERE 

3 <110> APPLICANT: CAROL, Pierre 

4 KUNTZ, Marcel 

7 <120> ^Se'ofI^ENTION: CDNA SEQUENCE TRANSCRIBING AN mRNA ENCODING THE TERMINAL 

8 ASSOCIATED WITH CAROTENOID BIOSYNTHESIS , AND USES THEREOF 
10 <130> FILE REFERENCE: 109326 

12 <140> CURRENT APPLICATION NUMBER: US 09/807,867 

13 <141> CURRENT FILING DATE: 2001-06-15 

15 <150> PRIOR APPLICATION NUMBER: PCT/IB99/01719 

16 <151> PRIOR FILING DATE: 1999-10-20 

18 <150> PRIOR APPLICATION NUMBER: FR 9813283 

19 <151> PRIOR FILING DATE: 1998-10-20 
21 <160> NUMBER OF SEQ ID NOS : 10 

23 <170> SOFTWARE: Patentln version 3.0 

25 <210> SEQ ID NO: 1 

26 <2U> LENGTH: 1396 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Arabidopsis thaliana 

30 <400> SEQUENCE : 1 _____ ™,^tnt ocatttctac 60 



31 ccgctcacat tgggattcgt cattcttctt ctaaaacccg caaaatttct ccatttctac 
33 caaaaatatc caacttttac ttttctttcc tgtgaaatta tctgctcaaa tctttggttc 
35 ctgacggaga tggcggcgat ttcaggcatc tcctctggta cgttgacgat ttcacggcct 
37 ttggttactc ttcgacgctc tagagccgcc gtttcgtaca gctcctctca ccgattgctt 
39 catcatcttc ctctctcttc tcgtcgtctg ctattaagga acaatcatcg agtccaagca 
41 acgattttgc aagacgatga agagaaagtg gtggtggagg aatcgtttaa agccgagact 
43 tctactggta cagaaccact tgaggagcca aatatgagtt cttcttcaac tagtgctttt 
45 gagacatgga tcatcaagct tgagcaagga gtgaatgttt tccttacaga ctcggttatt 
47 aagatacttg acactttgta tcgtgaccga acatatgcaa ggttctttgt tcttgagaca 
49 attgctagag tgccttattt tgcgtttatg tctgtgctac atatgtatga gacctttggt 
51 tggtggagga gagcagatta tttgaaagta cactttgctg agagctggaa tgaaatgcat 
53 cacttgctca taatggaaga attgggtgga aattcttggt ggtttgatcg ttttctggct 
55 cagcacatag caaccttcta ctacttcatg acagtgttct tgtatatctt aagccctaga 
57 atggcatatc acttttcgga atgtgtggag agtcatgcat atgagactta tgataaattt 
59 ctcaaggcca gtggagagga gttgaagaat atgcctgcac cggatatcgc agtaaaatac 
61 tatacgggag gtgacttgta cttatttgat gagttccaaa catcaagaac tcccaatact 
63 cgaagaccag taatagaaaa tctatacgat gtgtttgtga acataagaga tgatgaagca 
65 gaacactgca agacaatgag agcttgtcag actctaggca gtctgcgttc tccacactcc 
67 attttagatg atgatgatac tgaagaagaa tcagggtgtg ttgttcctga ggaggctcat 
69 tqcgaaggta ttgtagactg cctcaagaaa tccattacaa gttaataaat tagaaagtaa 
71 actaaaaaag attatttgta tcagctcatg aacaatagat ataatcccat atacttggga 
7 3 ataaaggaat aatgtgaaat tcccatcgtt gtgctagtgt gtgagagaat caaataccct 
75 aatgatgtaa atgtactttg atgagcttaa gtcgttgtag accattttat caaaaaaaaa 
77 aaaaaaaaaa aaaaaa 

80 <210> SEQ ID NO: 2 

81 <211> LENGTH: 351 

82 <212> TYPE: PRT 

83 <213> ORGANISM: Arabidopsis thaliana 
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85 <400> SEQUENCE: 2 ■ 
87 Met Ala Ala lie Ser Gly lie Ser Ser Gly Thr Leu Thr lie Ser Arg 



88 1 
90 



5 10 15 

Pro Leu Val Thr Leu Arg Arg Ser Arg Ala Ala Val Ser Tyr Ser Ser 
91 20 25 30 

93 Ser His Arg Leu Leu His His Leu Pro Leu Ser Ser Arg Arg Leu Leu 

94 35 40 45 

96 Leu Arg Asn Asn His Arg Val Gin Ala Thr He Leu Gin Asp Asp Glu 

97 50 55 60 

99 Glu Lys Val Val Val Glu Glu Ser Phe Lys Ala Glu Thr Ser Thr Gly 

100 65 70 75 80 
102 Thr Glu Pro Leu Glu Glu Pro Asn Met Ser Ser Ser Ser Thr Ser Ala 

85 90 95 

105 Phe Glu Thr Trp He He Lys Leu Glu Gin Gly Val Asn Val Phe Leu 

106 100 105 

108 Thr Asp Ser Val lie Lys He Leu Asp Thr Leu Tyr Arg Asp Arg Thr 

109 115 12° 125 

111 Tyr Ala Arg Phe Phe Val Leu Glu Thr He Ala Arg Val Pro Tyr Phe 

H4 Ala Phe Met Ser Val Leu His Met Tyr Glu Thr Phe Gly Trp Trp Arg 
115 145 150 I 55 lbU 

117 Arg Ala Asp Tyr Leu Lys Val His Phe Ala Glu Ser Trp Asn Glu Met 

165 170 175 

120 His His Leu Leu He Met Glu Glu Leu Gly Gly Asn Ser Trp Trp Phe 

121 180 185 I 90 

123 Asp Arg Phe Leu Ala Gin His He Ala Thr Phe Tyr Tyr Phe Met Thr 

124 195 200 205 

126 Val Phe Leu Tyr He Leu Ser Pro Arg Met Ala Tyr His Phe Ser Glu 

127 210 215 220 

129 Cys Val Glu Ser His Ala Tyr Glu Thr Tyr Asp Lys Phe Leu Lys Ala 

130 225 230 235 

132 Ser Gly Glu Glu Leu Lys Asn Met Pro Ala Pro Asp He Ala Val Lys 

133 245 250 255 



135 Tyr Tyr Thr Gly Gly Asp Leu Tyr Leu Phe Asp Glu Phe Gin Thr Ser 

136 260 265 270 

138 Arg Thr Pro Asn Thr Arg Arg Pro Val He Glu Asn Leu Tyr Asp Val 

139 275 280 285 

141 Phe Val Asn He Arg Asp Asp Glu Ala Glu His Cys Lys Thr Met Arg 

142 290 295 300 

144 Ala Cys Gin Thr Leu Gly Ser Leu Arg Ser Pro His Ser He Leu Asp 

145 305 310 315 

147 Asp Asp Asp Thr Glu Glu Glu Ser Gly Cys Val Val Pro Glu Glu Ala 

325 330 

150 His Cys Glu Gly He Val Asp Cys Leu Lys Lys Ser lie Thr Ser 

151 340 345 350 

153 <210> SEQ ID NO: 3 

154 <211> LENGTH: 1387 

155 <212> TYPE: DNA 

156 <213> ORGANISM: capsicum 
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158 <400> SEQUENCE : 3 .trtar.tata aqttttcgaa 60 

120 



159 ^"'aaaaaaa tcaagaatgg cgatttccat atctgctatg agttttcgaa 
i fil cttcaotttc ttcttcatat tcagcatttt tgtgcaattc caagaaccca ttttgtttga 
^3 attctctat? ttcacttagg aattctcata gaacttttca gccttcgtta tcaaggaaat 
ill caagtagagt tcgagcaacg ttgttaaaag agaatgaaga agaagtggtt gtggagaaat 
^67 cttScacc taagagttt? cctggtaatg tgggaggggg aaataatggg W°« c «9 
169 ataatCatc ctcgaacggt ctggagaaat gggttataaa gattgagcag tctgtaaata 
111 tc«tc?cac gga?tcag?g ataaagattc ttgacacttt gtatcacgac cgacactatg 
cqaggttt^t cgttctggaa acaattgcaa gagttcctta ttttgcattt «tatctgtte 

™ SEE 223= SSSE SSSS 

S ssbs »=« ssss ^ is 

183 oatacaaaac ttacgataaa ttcatcaagg atcaagaagc ggaattgaag aaattgcccg 
is5 cScaaagat ^gcagtgagc tactacaccg gaggtgactt gtatttgttc ^tgagtttc 
ill aaacatcacg agagcctaat actcgaaggc caaaaataga taatctgtac gacgtattca 
189 JgaacKcag aga^gacgaa gcagagcatt gtaagacaat gaaagcgtgt caaacccatg 
191 aaaacctccg ctcccctcac acaaatccat gcgatgagtc tgaagacgat ccaggttgtt 
13 cag?gcctca ggccgattgt gtaggtatcg tggattgtat aacgaaatct £cgctgatc 
195 ctaacgtcgg cagaaggtag ggaaaggaaa jacgcagaac saaactatac atgtatatac 
197 cagtacagcc aaatatacaa gaaat°ataca tacatattgt atcttttact ^ctgagg 
199 gagcttgtca aattgcccaa aaaatgggta ggcacttggt tttgttttca cctttcaata 
111 ItStacta aactatgaac aaatttgctc cggcacacta caactccata ggggtcctgt 
201 atttgtacua ddLLauya ? t tctcaqcaaa aaaaaaaaaa 1380 

203 tacgcttctg aactaaattt taacatattt ttgtcaacct tcccagca 

205 aaaaaaa 

208 <210> SEQ ID NO: 4 

209 <211> LENGTH: 357 

210 <212> TYPE: PRT 

211 <213> ORGANISM: capsicum 

213 <400> SEQUENCE: 4 s 
215 Met Ala lie Ser He Ser Ala Met Ser Phe Arg Thr Ser Val Ser Ser 

III Ser Tyr Ser Ala Phe Leu Cys Asn Ser Lys Asn Pro Phe Cys Leu Asn 



25 3° 
s Arg Thr Phe 
40 45 
Ser Arg Lys Ser Ser Arg Val Arg Ala Thr Leu Leu Lys Glu Asn Glu 

50 55 60 

Glu Glu Val Val Val Glu Lys Ser Phe Ala Pro 

228 65 70 75 

Asn val Gly Gly Gly Asn Asn Gly Glu Pro Pro 

on 85 90 n 

233 Asn Gly Leu Glu Lys Tr P Val lie Lys lie Glu Gin Ser Val Asn He 

Leu Asp Thr 
120 125 



22 9 1 Ser Leu Phe Ser Leu Arg Asn Ser H~is Arg Thr Phe Gin Pro Ser Leu 
222 

224 Ser Arc 

227 Glu G°iu Val Val Val Glu Lys Ser Phe Ala Pro Lys Ser Phe Pro Gly 



230 Asn val Gly Gly Gly Asn Asn Gly Glu Pro Pro Asp Asn Ser Ser Ser 

231 

233 Asn Gly Leu Glu 

236 Phe Leu Thr Asp Ser Val He Lys ill Leu Asp Thr Leu Tyr His Asp 

237 

239 Arg His 

2I2 Tyr III Ala Phe He Ser Va'l Leu His Leu Tyr Gin Ser Phe Gly Trp 
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150 155 
III Trp Arg Arg Ala Asp Tyr Leu Lys Val His Phe Ala Glu Ser Trp Asn 

165 170 1/3 

III Glu Met His His Leu Leu lie Met Glu Glu Leu Gly Gly Asn Ala Trp 
249 180 185 190 

251 Trp Phe Asp Arg Phe Leu Ala Gin His lie Ala Val Phe Tyr Tyr Phe 

252 195 200 205 

254 Met Thr Val Ser Met Tyr Ala Leu Ser Pro Arg Met Ala Tyr His Phe 

255 210 215 220 

257 Ser Glu Cys Val Glu His His Ala Tyr Glu Thr Tyr Asp Lys Phe lie 

258 225 230 235 240 

260 
261 



Lys Asp Gin Glu Ala Glu Leu Lys Lys Leu Pro Ala Pro Lys lie Ala 



245 250 



263 Val Ser Tyr Tyr Thr Gly Gly Asp Leu Tyr Leu Phe Asp Glu Phe Gin 
264 
266 



264 260 265 270 

Thr Ser Arg Glu Pro Asn Thr Arg Arg Pro Lys lie Asp Asn Leu Tyr 

269 Asp Val Phe Met Asn lie Arg Asp Asp Glu Ala Glu His Cys Lys Thr 

270 290 295 300 

272 Met Lys Ala Cys Gin Thr His Gly Ser Leu Arg Ser Pro His Thr Asn 

273 305 310 315 

275 Pro Cys Asp Glu Ser Glu Asp Asp Pro Gly Cys Ser Val Pro Gin Ala 

330 335 



276 325 



278 Asp Cys Val Gly He Val Asp Cys He Thr Lys Ser Val Ala As P Pro 

279 340 345 350 

281 Asn Val Gly Arg Arg 

282 355 

284 <210> SEQ ID NO : 5 

285 <211> LENGTH: 1284 

286 <212> TYPE: DNA 

287 <213> ORGANISM: tomato 

289 <400> SEQUENCE: 5 .^^.n- 

290 gaattcggca cgagcggcac gagcagaaaa ctaacaactt tcccactttg gaattttctt 
292 taccttacct aagaagggta ttaatttgat tcttgtggga aggaagaagg Jtcaagaatg 
294 gcgatttcga tttctgctat gagttttgga acctcagttt cttcatattc ttgttttaga 
296 gctaggagtt ttgagaagtc atcagtttta tgcaattccc agaacccatg tcggtttaat 
298 tctgtttttc cgattcggaa atctgatggg gcttcacggt gttctgtttc taggaaatca 
300 tgtagagttc gagcaacgtt gttacaagag aatgaagaag aagtggttgt ggagaaatct 
302 tttgcaccta agagttttcc tgataacgtg ggagggggaa gtaatgggaa gccaccagat 
304 gattcatcct ctaacggtct agagaaatgg gttataaagc ttgagcagtc tgtaaatatc 
306 ttactcacgg attcagtgat aaagattctt gacactttgt atcacaaccg aaactatgcg 
308 aggttttttg ttctggaaac aattgcaagg gttccttatt ttgcatttat atcggttctt 
310 cacatgtatg agagctttgg ctggtggaga agggcagatt atatgaaagt 9«ttttgct 
312 gaaagctgga atgagatgca ccatttgctc attatggaag aattaggggg aaatgcttgg 
314 ?ggt?tgatc gattLttgc acaacatata gctatattct attatttcat gacagtcttg 
316 atgtatgctt tgagcccgag aatggcatat catttctctg aatgtgtgga g^catgca 
318 tacgagactt acgataaatt catcaaggat caaggagagg aattgaagaa tttgcccgct 
320 ccaaagattg cagtggacta ctacacggga ggtgacttat atttatttga tgagtttcaa 
322 acttcacgag agcctaatac tcgaagacca aaaatagata atctctatga cgtattcatg 
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324 aacattagag atgacgaagc agagcattgt aaaacgatga aagcctgtca aactcacggg 1080 
326 agccttcgt? ctccacacac agatccatgc gatgattctg aagatgatac agggtgttcc 1140 
III gfacctcaag ctgattgtat aggtatcgtg gattgtataa agaagtcag \™ 
330 caagtaacca aaaggtagga aaaggaaaaa cgcggacaaa ctatacttgt atatactagt 

332 atagacaaaa aaaaaaaaaa aaaa 

335 <210> SEQ ID NO: 6 

336 <211> LENGTH: 19 

337 <212> TYPE: DNA 

C--> 338 <213> ORGANISM: Artificial 

340 <220> FEATURE : S 

341 <223> OTHER INFORMATION: PCR primer^ 

343 <400> SEQUENCE: 6 19 

344 gcaacgattt tgcaagacg 

347 <210> SEQ ID NO : 7 

348 <211> LENGTH: 24 

349 <212> TYPE: DNA 

C --> 350 <213> ORGANISM: Artificial 

352 <220> FEATURE: 

353 <223> OTHER INFORMATION: PCR primer^ 

355 <400> SEQUENCE: 7 24 

356 ttaacttgta atggatttct tgag 

359 <210> SEQ ID NO: 8 

360 <211> LENGTH: 171 

361 <212> TYPE: PRT 

362 <213> ORGANISM: soybean 



III Is Leu Leu W He pro Thr i3 p Leu Phe Phe L y s 

if>i 1 5 1^ 

III Arg Arg Tyr Gly Cys Arg Ala Met Met Leu Glu Thr Val Ala Ala Val 
370 20 



372 Pro Gly Met Val Gly Gly Met Leu Leu His Leu Arg Ser Leu Arg Lys 



375 Phe Gin Gin Ser Gly Gly Trp He Lys Ala Leu Leu Glu Glu Ala Glu 
^7fi sn 55 60 

378 Asn Glu Arg Met His Leu Met Thr Met Val Glu Leu Val Lys Pro Lys 
381 Trp Tyr Glu Arg Leu Leu Val Leu Ala Val Gin Gly Val Phe Phe Asn 

384 Ala Phe Phe Val Leu Tyr lie Leu Ser Pro Lys Val Ala His Arg He 

385 100 IO 5 110 



387 Val Gly Tyr Leu Glu Glu Glu Ala He His Ser Tyr Thr Glu Tyr Leu 

Ala 
135 



388 115 I 20 125 

390 Lys Asp 

393 Ala lie Asp Tyr Tr P Arg Leu Pro Lys Asp Ala Arg Leu Lys Asp Val 



390 Lys Asp Leu Glu Ser Gly Ala He Glu Asn Val Pro Ala Pro Ala lie 

i m 140 



394 145 



150 155 



He Thr Val He Arg Ala Asp Glu Ala His His 
165 170 



396 
397 

399 <210> SEQ ID NO: 9 
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L-338 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID# : 6 
L ":350 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:7 
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